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IN^SIXU METHOD AND APPARATUS FOR END POINT DETECTION 
IN CHEMICAL MECHANICAL POLISHING 

Brief Description of the Invention 
5 The present invention relates to an in-situ method and apparatus for end point 

detection during chemical mechanical polishing, and more particularly to a method and 
apparatus in which localized areas of the surface of a semiconductor wafer or substrate 
which is undergoing chemical mechanical polishing are monitored to detect the removal 
of material from the localized wafer surface areas. 
10 Related Applications 

The present invention is related to co-pending US patent application Serial No. 

(Attorney Docket No. A-69175/MSS) filed simultaneously herewith and is 

incorporated by reference in its entirety. The present invention further claims the benefit 
of US provisional patent application Serial No. 60/258,931, filed December 29, 2000, 
15 which is incorporated herein by reference in its entirety. 

Relevant Literature 

The following literature references describe chemical mechanical polishing and 
various prior art end point detecting techniques. 
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5 Background of the Invention 

Manufacture of semiconductors has become increasingly complex as the device 
densities increase. Such high density circuits typically require closely spaced metal 
interconnect lines and multiple layers of insulating material, such as oxides, formed atop 
and between the interconnect lines. Surface planarity of the semiconductor wafer or 

1 0 substrate degrades as the layers are deposited. Generally, the surface of a layer will have 
a topography that conforms to the sublayer, and as the number of layers increase the non- 
planarity of the surface becomes more pronounced. 

To address the problem, chemical mechanical polishing (CMP) processes are 
employed. The CMP process removes material from the surface of the wafer to provide 

15 a substantially planar surface. More recently, the CMP process is also used to fabricate 
the interconnecting lines. For example, when depositing copper leads or interconnect 
lines, a full layer of the metal 13 is deposited on the surface of the wafer 10 having 
grooves 12 formed in an oxide layer 1 1 as shown in Figs. 1 A and IB. The metal layer 13 
may be deposited by sputtering or vapor deposition or by any other suitable conventional 

20 technique. The oxide layer, such as doped or undoped silicon dioxide, is usually formed 
by chemical vapor deposition (CVD). The metal layer covers the entire surface of the 
wafer and extends into the grooves. Thereafter, individual leads 16 are defined by 
removing the metal layer from the surface of the oxide. The CMP process may be used 
to remove the surface metal leaving the leads 16 in the grooves. The leads are insulated 

25 from one another by the intervening oxide layer. 

In general, to carry out the CMP process, a chemical mechanical polishing (CMP) 
machines is used. Many types of CMP machines are used in the semiconductor industry. 
CMP machines typically employ a rotating polishing platen having a polishing pad 
thereon, and a smaller diameter rotating wafer carrier which carries the wafer whose 

30 surface is to be planarized and/or polished. The surface of the rotating wafer is held or 
urged against the rotating polishing pad. A slurry is fed to the surface of the polishing 
pad during polishing of the wafer. 
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It is desirable to precisely determine when the material has been removed from the 
upper surface of the wafer during the CMP process. This not only prevents discarding of 
* over-polished wafers, but also minimizes the necessity of re-polishing any under-polished 
wafers. There are many possible ways of determining when to stop the CMP process. 

5 Typical methods include: (1) detecting fiictional change as the top layer of metal is 
polished away to expose the silicon oxide layer by monitoring the current to the platen 
and carrier motors, and (2) monitoring thermal and acoustic signatures from the polishing 
pad. Electfical impedance, conductance and capacitance can also be used to determine 
the presence of the metal layers. 

10 More recently, optical measurement has been used in the art with the CMP 

process. For example, U.S. Patent No. 5,838,448 uses interferometry and describes 
detecting the thickness of a thin layer, or the changes in the film thickness, by measuring 
reflectance variations caused by a change in the incidence angle of incident light. U.S. 
Patent No. 5,835,225 describes using reflectance measurements to determine a particular 

15 surface property of the substrate. U.S. Patent No. 5,433,651 describes a method and 
apparatus for viewing the wafer during polishing and end-pointing the CMP process when 
a prescribed change in the in-situ reflectance corresponds to a prescribed condition of the 
polishing process. 

While these techniques have provided improvements to the CMP process, these 
20 methods provide average (global) characteristics of the whole wafer surface, rather than 
those of smaller, localized regions or areas of the wafer. This means that, although one 
part of the wafer may get polished before another, the global system is not typically able 
to differentiate between over-polished and under-polished regions of the wafer. 

In another prior art technique, as described in U.S . Patent No. 5 ,972,787, indicator 
25 areas are provided on the wafer. These indicator areas are formed of blocks of parallel 
metal lines with varying line widths and pattern factors that are chosen to violate existing 
ground rules in such a way that they will be dished out using the standard consumable set 
(pad/slurry) of a given metal CMP process. The blocks are then inspected to determine 
the extent of polishing. While this technique provides for indicating the polishing in 
30 certain areas of the wafer, the process requires that the CMP step be interrupted for the 
inspection to take place. Further, the indicator areas require formation of the blocks 
which add an additional step to the already complex fabrication process. 
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In addition, the copper (Cu) damascene process is emerging as a critical 
technology to produce high-speed, high-performance, and low energy-consuming Ultra- 
* Large-Scale Integrated (ULSI) circuits. In copper damascene, the CMP process is 
employed to remove the excess copper and barrier materials (typically Ta, Ti, TaN or 

5 TiN) and to form interconnects inside the trenches in the inter-layer dielectric (ILD, 
typically Si0 2 or polymers). The copper damascene process adds additional complexities 
to the CMP process. It has been reported that the material removal rate of Cu strongly 
depends on the pattern geometry. The nonuniform pattern layout usually causes 
nonuniform polishing across the die area, and results in partial overpolishing on the area 

10 with higher Cu fraction and dishing on the soft Cu lines. The Cu loss and surface 
nonuniformity due to overpolishing and dishing may affect the reliability of interconnects 
and must be minimized. Additionally, the nonuniformity of initial Cu coating, the spatial 
variation of the process parameters (velocity, pressure, slurry transport, etc.), and the 
process random variation will increase the within-wafer and within-lot nonuniformity of 

1 5 polishing. These result in a variation of the completion time, or the endpoint, of the Cu 
CMP and impact the process yield. In order to reduce the variance of polishing outputs 
(uniformity, overpolishing and dishing), it is desirable to integrate an in-situ sensing and 
endpoint detection technique with the process optimization schemes to improve process 
performance. 

20 The wafer-level endpoint for the copper CMP process may be defined as the time 

when the excess Cu and barrier layers are fully cleared up on a specified number (or 
percentage) of dies of a wafer. Due to the polishing nonuniformity, all the dies on a wafer 
generally will not reach the endpoint at the same moment, and some of the dies may be 
overpolished. Thus the endpoint of CMP can be a representation of the optimal polishing 

25 time at which the number of out-of-specs dies (either under- or over-polished) reaches a 
minimum and the process yield is maximized. However, the remaining Cu thickness on 
each die area is difficult to measure in real-time to determine the endpoint. Most of the 
prior art insitu sensing techniques rely on indirect methods to detect the moment of 
Cu/barrier clear-up, such as the changes in the friction force, the ion concentrations of the 

30 Cu/barrier materials, and. the electrical impedance on the surface. However, these 
methods are limited due to the lack of reliability and the problem of high noise-to-signal 
ratio in practical applications. Moreover, all these techniques provide only average 
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information over a relative large area (usually wafer-level) and lack the capability of 
sensing within-wafer and die-level uniformity. Therefore, these methods can just be used 
* as supplementary methods with other primary metrology to assure the detection of 
endpoint. 

5 Recently, the capability of a photoacoustic technique on the thickness 

measurement of multi-layer stacked films has been investigated. Two optical excitation 
pulses are overlapped on the surface of the coating to form an interference pattern. 
Absorption of light by the film generates counter-propagating acoustic wave. By 
measuring the acoustic frequency, the film thickness can be calculated. However, this 

1 0 method is limited to a blanket area with the dimensions much larger than the beam size. 
It is difficult to model the generation and the propagation of the acoustic wave in thin Cu 
film on the patterned area. Hence, this method is currently limited to the measurements 
for blanket wafers or large patterns which can be simulated as blanket areas. 

Among all the endpoint detection techniques, optical sensing techniques may 

1 5 prove to be the most successful. Interferometry technology is employed to measure the 
film thickness based on the interference of light from the surface of the top and the 
underlying layers. This may be suitable for measuring transparent films such as dielectric 
layers, but not effective for opaque metal films. In theory, the reflectance measurement 
may be used for detecting the surface topography and the metal area fraction on the 

20 surface. Moreover, because the reflectance of patterned surface is influenced by the 
topography of the pattern, it may also be possible to gain information on surface planarity 
and dishing by this metrology. While the reflectance technique holds promise, 
significant development is needed to provide a practical end point detection system and 
method. 

25 Accordingly, there is a need for an improved method and apparatus that can 

continuously, and in-situ, monitor localized regions of the wafer surface during the CMP 
process. 

Summary of the Invention 
It is an object of the present invention to provide an in-situ method and apparatus 
30 for monitoring localized regions of the wafer surface during the CMP process. 

It is another object of the present invention to provide a method and apparatus 
which continuously monitors the polishing progress at different areas of the wafer, and 
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may also be used to determine the end point for removal of material from the surface of 
the wafer. 

It is a further object of the present invention to provide a method and apparatus 
which employs the difference in reflectance between different materials on a wafer to 

5 monitor the polishing progress and/or end point at selected regions on the wafer surface. 

It is yet another object of the present invention to provide a method and apparatus 
which monitors reflectance at various surface areas of the wafer and controls the polishing 
process at said areas to achieve substantially uniform removal of metal during polishing. 
It is an even further object of the present invention to provide an in-situ method 

10 and apparatus for monitoring surface conditions and detecting the process endpoint for 
cooper damascene CMP. 

The foregoing and other objects of the invention are achieved by a chemical 
mechanical polishing method and apparatus in which a rotating polishing platen and 
polishing pad of a first diameter polishes a wafer carried by a wafer carrier. A window 

15 is formed in the polishing platen and pad whereby said window periodically scans across 
the underside of the wafer. An optical detector, such as a fiber optic cable, transmits light 
through the window onto the surface of the carrier and receives light reflectance through 
the window from said wafer surface as it rotates past the window and means are provided 
for monitoring the reflected light, and for controlling the polishing process at localized 

20 regions of the wafer responsive to the reflected light information. 

More specifically, the chemical mechanical polishing method and apparatus 
includes a wafer carrier that has a membrane having a central and concentric pressure 
chambers or compartments which define corresponding zones or regions on the wafer 
surface. An actuator is provided to control the pressure applied to the central and 

25 concentric compartments and thereby control the rate of removal of material from the 
wafer surface at each of the corresponding zones, and the actuator is engaged responsive 
to reflected light received at each of the zones. 

In another aspect of the present invention, a method of chemical mechanical 
polishing is provided comprising the steps of : providing a CMP machine which includes 

30 apolishing pad and a wafer carrier having multiple chambers that allow for independently 
varying pressure within the chambers that urge against a wafer at corresponding localized 
regions on the wafer; measuring the reflectance of the surface of the wafer during 
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polishing at each of the localized regions on the wafer; processing the reflectance data to 
determine the state of polishing within each of the localized regions; and independently 
- adjusting the pressure within any one of the chambers responsive to the state of polishing 
within each of the corresponding localized regions. 
5 Brief Description of the Drawings 

The foregoing and other objects and features of the invention will be more clearly 
understood from the following description when read in connection with the 
accompanying drawings in which: 

Figs. 1 A and IB show the surface of a wafer with a trenched oxide coating with 
10 conductive interconnect material applied to the surface, Figure 1 A, and polished, Figure 
IB, to leave leads. 

Fig. 2 is a top plan view of a rotating polishing platen and polishing pad with a 
wafer carrier and observation window in accordance with the present invention. 

Fig. 3 is a partial sectional view showing the rotating polishing platen, polishing 
1 5 pad and wafer carrier in accordance with the present invention. 

Fig. 4 shows the diaphragmed pressure pad of the wafer carrier associated with 
a metalized wafer in accordance with one embodiment of the present invention. 

Fig. 5 schematically shows the wafer surface with concentric annular areas and 
the path of the scanning window across the wafer according to the present invention. 
20 Fig. 6 is a schematic of the optical end point detection system according to one 

embodiment of the present invention. 

Fig. 7 shows the output voltage as a function of the gap between the end of the 
fiber optics bundle and the wafer surface for one exemplary embodiment of the present 
invention. 

25 Fig. 8 shows reflectance as a function of wavelength for various materials. 

Fig. 9 shows the reflectance as a function of wafer position at various polishing 
times for one exemplary embodiment of the present invention. 

Fig. 10 illustrates one example of actual reflectance as a function of time as 
compared to an ideal signal. 
30 Fig. 11 is a schematic block diagram of a control loop for one example of a 

chemical mechanical polishing apparatus that may be used with the present invention. 
Fig. 12 is a flow chart illustrating processing of the output signal from the 
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reflectance sensor for one embodiment of the present invention. 

Fig. 13 is a flow chart illustrating the control of pressure at the various wafer 
- zones in accordance with an alternative embodiment of the present invention. 

Figure 14 shows a schematic diagram of light scattered on a patterned Cu surface. 
5 Figures 15a and 1 5b show schematic diagrams of light scattered from (a) a planar 

composite surface, and (b) a wavy composite surface. 

Figure 16 illustrates sensor kinematics in accordance with one example of the 
present invention. 

Figure 17 shows the simulated locus for the reflectance sensor across the wafer 
10 at the condition W w -W p and r s -r cc . 

Figure 18 shows the simulated locus for the reflectance sensor across the wafer 
at the condition W w - 1 .05 W p and r s -r cc . 

Figure 19 shows the results of off-line measurements at the copper planarization 
regimes on the pattern with 0.5 area fraction {w/X = 0.5) in accordance with one 
1 5 embodiment of the present invention. 

Figure 20 shows the results of off-line measurements at the Cu planarization 
regime on the patterns with 0.01 area fraction (w/A, = 0.01) in accordance with another 
embodiment of the present invention. 

Figure 2 1 shows the time evolution of step-heights for patterns with constant area 
20 fraction 0.5 and 0.01 in accordance with experiments of the present invention. 

Figure 22 shows the results of off-line measurements at various process regimes 
on the pattern with 0.5 area fraction. 

Figure 23 shows the results of off-line measurements at various process regimes 
on the patterns with 0.01 area fraction. 
25 Figure 24 shows the time evolution of Cu dishing for patterns with constant area 

fraction 0.5 and various linewidths. 

Figure 25 shows the time evolution of Cu dishing for patterns with constant area 
fraction 0.01 and various linewidths. 

Figure 26 shows the off-line measurements of the mean and standard deviation of 
30 surface reflectance along different loci across the wafer at the onset of endpoint. 

Figure 27 shows a comparison of the off-line measurements (mean and standard 
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deviation) on the center die and across wafer at various polishing stages. The across- 
wafer data is calculated based on the measurements along five loci. 

Figure 28 shows raw data from in-situ reflectance measurements made according 
to examples of the present invention. 
5 Figure 29 shows the results of in-situ measurements of the moving average and 

standard deviation of wafer-level surface reflectance. 

Figure 30 shows the results of in-situ measurements of the standard deviation of 
wafer-level surface reflectance. 

Figures 31a to 31 e shows the distribution of surface reflectance versus polishing 
1 0 time from the in-situ measurements made according to examples of the present invention. 

Figure 32 shows the simulated loci for the reflectance sensor across the wafer at 
the condition W w -1.05w p and r==1.25r cc . 

Figure 33 shows the decomposition of the within-wafer and within-die variance 
for the in-situ measurements. 
15 Figure 34 shows the results of the sampled moving average versus time with 

estimated interval at 99.5% confidence interval. 

Figure 35 shows the results of in-situ measurements of the ratio of the standard 
deviation to the mean reflectance (wafer-level). 

Figure 36 shows the results of the range of surface reflectance versus polishing 
20 time (wafer-level). 

Figure 37 shows experimental validation for various in-situ sensing and endpoint 
detection schemes. 

Detailed Description of the Invention 
The inventors have discovered a method and apparatus for providing in-situ 

25 monitoring of the removal of materials in localized regions on a semiconductor wafer or 
substrate during chemical mechanical polishing (CMP). In particular, the method and 
apparatus of the present invention provides for detecting the differences in reflectance 
between different materials, such as conductive, insulating and barrier materials, within 
certain localized regions or zones on the surface of the wafer. The differences in 

30 reflectance are used to indicate that the top or bulk material has been removed in each of 
the localized zones. In the preferred embodiment this information is used to provide 
real-time control of the CMP process. 
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Specifically, referring to Figs. 2 and 3 is shown a portion of a CMP machine 
which includes a rotating platen 2 1 and a rotating wafer 22 carried by a wafer carrier (not 
* shown) in accordance with one embodiment of the present invention. The platen 21 
carries a polishing pad 23 onto which a polishing slurry is applied during the CMP 
5 process. The CMP machine in the present embodiment is employed to remove surface 
material, either a conductive or insulating material, from the surface of the wafer. In one 
embodiment, the surface material is a metal, and the metal is removed from the wafer 
surface to leave conductors imbedded in trenches in an insulating layer. The conductive 
material can be any suitable conductor such as aluminum or copper. The insulating 

10 material can be any suitable insulator such as un-doped silicon dioxide, silicon oxide 
doped with boron, phosphorous, or both, or low dielectric constant materials. Also, the 
present invention may be used to remove conductive or insulating materials to expose a 
barrier material, such as TaN and the like. Further, the barrier layer may also be removed. 
In one embodiment the present invention is directed to a method for detecting surface 

1 5 metal removal to fabricate a structure such as that schematically illustrated in Figure IB. 
The present invention exploits the reflective differences between the conductive 
(typically metal) and the insulating materials to monitor the progress of planarizing of the 
wafer, and to determine which localized regions are nearing removal of the material and 
thus the end point of the polishing process. 

20 To monitor the CMP process, the difference in reflectance between the conductive 

and the insulating materials are observed. The preferred conductive materials used for 
leads in semiconductor devices are aluminum and copper, which are approximately 90- 
95% reflective for light around one micrometer in wavelength. The reflectance as a 
function of wavelength for copper, aluminum, silicon and tantalum are shown in Fig. 8. 

25 Most insulating materials such as silicon oxide are, as can be seen from Fig. 8, 25-30% 
reflective at the same wavelength. This difference in reflectance is used to monitor the 
polishing process. During the CMP process, the pre-polished reflectance from the wafer 
surface is expected to be about 90% due to full coverage of metal on the surface of the 
wafer. Upon completion of the CMP process the post-polish reflectance is expected to 

30 be lower; in one example in the range of about 25-60%, because the exposed surface has 
a mixture of insulating material and the metal conductors in the trenches. It is important 
to note that the these numbers are given for general purposes only, and that the actual 
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different in reflectance between the conductive and insulating or barrier materials will 
vary primarily based on the type of material and on the pattern and pattern density on the 
* surface of the wafer. In general, lower the density of the metal lines on the patterned 
wafer, the lower the reflectance value. In one exemplary embodiment of the present 

5 invention, the difference in reflectance between the conductive material, and the 
reflectance value which indicates that the CMP process is nearing completion or is 
substantially complete at a given zone, is observed to be up to about 65%. Again, the 
actual difference in reflectance will vary dependent on a number of factors, such as for 
example the type of material, whether the material is in bulk or patterned, the pattern 

1 0 density, the wavelength of the light, and the surface finish of the wafer (which may reduce 
the reflectance). 

An optical detection system, preferably a fiber optic reflectance system, is used 
in the present invention. Referring to Figs. 3 and 6, one example of the present invention 
shows a bundle 26 of optical fibers which transmit light from a light source 27 such as a 

15 light-emitting diode, to a sensor tip 28. Other optical fibers in the bundle 26 transfer 
light reflected from the surface of the wafer to a photodetector 29 connected to an 
amplifier system 3 1 including an operational amplifier 32 and low pass filter comprising 
capacitor 33 and resistor 34. The analog output from the operational amplifier is applied 
to an analog-to-digital converter 36, and then to a processing system which processes the 

20 digitized signal in a manner to be presently described. Such an fiber optic system is 
commercially available, such as a Philtec D64 sensor system. 

In the preferred embodiment, the emitting and receiving fibers are in parallel and 
are randomly distributed in the bundle 26 and oriented generally normal to the wafer 
surface, although other orientations are acceptable. According to the present invention, 

25 the light-emitting diode is selected to emit light at a wavelength that maximizes the 
differences in reflection of the particular materials on the surface of the wafer. In one 
example, where a copper layer is to removed to reveal copper leads placed within 
intervening silicon dioxide layers, the light-emitting diode is selected to emit light at a 
wavelength of preferably about 880 nm, which is in the range having optimal differences 

30 in reflection. Those skilled in the art will recognize that the wavelength providing the 
most optimal difference in reflectance between the conductive and insulating materials 
will vary depending on the types of the materials, but that such wavelengths can be 
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determined based on the teaching of the present invention. 

The gap distance "g" between the sensor tip 28 and the wafer 22 is important to 
- minimize fluctuations in the reflectance readings. Accordingly, preferably the sensor 
holder of the present invention is designed to allow gap adjustment. In one example, the 
5 sensor holder is comprised of a rigid housing witli a nut which receives a threaded sensor 
tip that screws onto the nut and the gap between the sensor tip 28 and the wafer is 
adjusted up or down simply by twisting. Other sensor holder configurations may be used 
so long as they provide a rigid structure that allows adjustment relative to the wafer 
surface. 

10 Increasing the gap distance "g" can minimize the influence of gap changes as 

illustrated in Fig. 7 which shows the characteristics of the sensor of the exemplary 
embodiment. Specifically, each sensor will exhibit a certain voltage at a certain gap 
distance, as can be determined experimentally or may be available form the manufacturer 
of the sensor. It is preferred to select a gap distance where the slope of the curve flattens 

15 out. In the exemplary embodiment, using a Philtec sensor the gap distance "g" is 
preferably in the range of about 200 to 250 mils, and more preferably in the range of 
about 200 and 225 mils. While, one specific example is shown, other suitable sensors 
may be used to measure reflectance of a wafer surface. However, any suitable sensor 
must be capable of projecting light onto the wafer and gather the reflected light, and 

20 providing an output signal for processing. 

To provide in-situ monitoring of the CMP process, the method and apparatus of 
the present invention employs the sensor tip, inserted in at least one window 36 formed 
in the rotating platen, to view the wafer during polishing as shown in Fig. 3. The fiber 
optics bundle with the light emitting diode detector and amplifier are mounted for rotation 

25 with the platen. A suitable slip coupling (not shown) may be used to transmit the analog 
signals through a rotating interface to the analog-to-digital converter 36. More than one 
window may be formed in the rotating platen, each having a sensor tip inserted therein 
for viewing multiple locations at the same time. When using multiple sensors, sampling 
techniques known in the art may be used to process the signal. The window may be of 

30 any shape and size, and is limited only by being able to adequately house the sensor tip, 
an preferably provides a small footprint to minimize the impact on the polishing process. 
Of particular advantage, the window 36 may be placed in any desired location 
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such that it traverses a desired region of the wafer during polishing. In the preferred 
embodiment, the center-to-center offset distance of the wafer and the window are 
* selected such that the sensor tip views the wafer in a scanning arc which travels through 
the center of the wafer. The scan line 37 shown in Fig. 5 illustrates one example of the 
5 scanning arc which travels through the center of the wafer. The polishing may be axis- 
symmetric, and thus a measure of the reflectance intensity at a distance from the wafer 
center is expected to be the same for all zones of equal radii. In the instance when 
polishing is axis-symmetric, the polishing level can be inferred for all other radii in any 
annular zone, as long as the 3ensor traverses across the center of the wafer. 
10 Alternatively, different scanning arc trajectories may be selected by changing the 

center-to-center offset and/or by varying the rotational speeds of both the wafer carrier 
and the platen. For example, up to a 10% rotational speed offset (i.e. difference in speed 
between the wafer carrier and the platen) allows one to "step" the trajectory across the 
wafer. 

15 The optical detection system needs to be protected from the polishing 

environment. This is accomplished by providing the window(s) 36 in the polishing pad 
23, flush with or slightly recessed from the pad surface. Preferably, the window has 
similar wear properties as those of the pad thus preventing any damage to the surface of 
the wafer. 

20 Of significant advantage the present invention provides for monitoring the CMP 

process in certain localized regions or zones. In particular, a plurality of zones are 
defined on the surface of the wafer and correspond to zones formed in a membrane that 
engage the wafer. Preferably, the zones are annular; however, the zones may be formed 
of any suitable shape. Referring to Figures 4 and 5, one example of these zones are 

25 schematically illustrated, and are further described in co-pending application Serial No. 

(Attorney Docket no. A-69175/MSS) wherein a wafer carrier with 

compartmentalized membranes engages the upper surface of the wafer and urges the 
wafer across the polishing pad. In this example, the compartments or chambers are in the 
form of concentric rings and define annular zones whereby the pressure between the wafer 

30 and the polishing pad is controlled by these annular zones which are adj acent to the wafer. 
Thus, by varying the pressure in the annular zones, the rate of polishing on the wafer 
is controlled at localized regions on the wafer corresponding to each of the annular zones. 
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More specifically, as further described in the above referenced co-pending 
application, a wafer carrier is provided which includes a flexible membrane that engages 
* the wafer and urges or presses the wafer against the polishing pad. Fig. 4 schematically 
illustrates such a wafer carrier 41 which includes a membrane 42 having concentric 
5 compartments 43 formed therein and sealed which define the multiple chambers or 
cavities 46. The chambers 46 form concentric rings with a center chamber 47 surrounded 
by one or more outer chambers 48. These chambers are defined as annular zones or 
regions. Each of the chambers separately engage the undersurface of the wafer 22, and 
thus define localized regions on the wafer surface corresponding to the adjacent annular 

10 zones. The pressure applied to the wafer 22 is separately controlled by the pressure in 
each of the chambers as indicated the arrows P r P 4 in Fig. 4. The result is that concentric 
zones or regions 48 on the wafer surface can be polished at different rates by controlling 
the pressure in the corresponding chambers 46. Although four zones are shown in the 
figures, any suitable number of two or more zones may be defined. Further, the zones 

1 5 may be of a different shape and are not limited to an annular shape, although an annular 
shape is preferred for the outer zones. In the preferred embodiment, the membrane 
contains four chambers defining four zones, the four zones being comprised of one 
circular center zone and three annular concentric zones. 

As the sensor traverses across the wafer during polishing, it monitors the polishing 

20 progress in the area of the wafer corresponding to one or more of the concentric surface 
zones. Non-uniform removal of material on the wafer surface tends to occur in patterns 
concentric about the central normal axis of the wafer due to the rotation of the wafer 
during polishing. The sensor detects the condition of the wafer a given distance away 
from the center, and a similar reflectance measurement may be assumed for all equal 

25 radii. As described in further detail below, this information regarding the condition of the 
wafer surface in the different zones is transmitted to a control system to produce a control 
signal which then selectively controls the pressure in the corresponding chambers behind 
the wafer as needed to selectively reduce wafer level non-uniformity during the CMP 
process. 

30 Additionally, the sensor is sensitive to scattering effects due to topographic 

variations found on the surface material layer on the wafer, particularly when the surface 
material is copper, just before planarization or removal of the layer. These topographic 
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variations are expected to become more planar during polishing and prior to removal, 
resulting in an increased reflectance signal. According to one embodiment of the present 
* invention this information is used to ascertain the wafer surface planarity during 
polishing, and is then used to modify the process parameters to provide more effective 

5 and/or efficient polishing. Initially, low pressure gives better planarization and as 
planarity is reached as indicated by an increased reflectance signal, the process may be 
modified to higher pressure and velocity to give an increase in removal rate. Thus, the 
overall polishing time may be reduced . Thus, the present invention provides a method 
and apparatus for providing feedback control to adjust the CMP process parameters, in 

10 addition to monitoring the CMP process. 

In another aspect of the present invention, the desired end-point of the CMP 
process is detected in-situ during polishing. A variety of methods may be used to monitor 
the CMP process and to determine the end-point. In one example, the end point of the 
CMP process is determined by comparing the sensor signal to a predetermined threshold 

1 5 value. Referring to Fig. 1 0, there is a comparison of the ideal signal and an actual signal 
obtained during removal of a metal coating (copper blanket wafers). It is seen that there 
is a measurable drop in reflectance as first, the conductive copper layer is removed, and 
second when the barrier layer is removed. Experimental results have shown a reasonable 
correlation between the ideal sensor signal and the actual sensor signal. Accordingly, a 

20 threshold reflectance value can be determined for each type of material and pattern type 
which can be used to compare against actual signals received during processing. When 
the threshold value is met in a given zone, pressure to the corresponding membrane 
chamber is reduced or removed to prevent further polishing in that region. 

Further, in addition to the threshold value, the entire pressure profile within each 

25 zone from the last wafer run can be used to control the next wafer. This control system 
is referred to as a "feed forward" or run-to-run" control system. This type of system 
assumes that the nest wafer to be polished will exhibit similar topology and material 
removal characteristic within the same location or zone as the previous wafer. Thus, a 
similar pressure profile is applied to the chambers to carry out a similar polishing process. 

30 Fig. 9 exhibits experimental results for tests conducted using the method and 

apparatus of the present invention. Wafers were polished having a blanket copper layer. 
The polishing took place until the blanket copper layer was removed to reveal a barrier 
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layer of TaN. Fig. 9 plots the reflectance received as a function of the wafer position (in 
inches) for multiple polishing passes in time (t) over the wafer. A number of 
* observations can be made. First, the material removal does occur substantially axis- 
symmetrically about the center of the wafer. The center of the wafer is the last localized 
5 region to be polished, and the edges of the wafer polish faster than the other regions of 
the wafer. This information can be used to create a pressure profile as described above, 
and sued to provide feed forward or run-to-run control. Specifically, the pressure is 
varied within each of the chambers corresponding to the localized position (i.e. zones) on 
the wafer to achieve the desired material removal. For example, the pressure in the 

10 outermost chambers which correspond to the edges of the wafer will be reduced at a 
selected time into the polishing process to account for the faster material removal rate in 
this region. The pressure may be reduced gradually, so that this region continues to be 
polished, but at a slower rate. Alternatively, the pressure may remain constant but will be 
at a lower value in this zone. Conversely, the center chamber which corresponds to the 

1 5 center position (or zone) of the wafer may receive increased pressure, the pressure may 
remain constant throughout the entire process, or a combination of both techniques may 
be used, since the center is the last zone to polish in this particular example. 

Fig. 11 shows a block diagram of one example of a control system that may be 
used with the present invention. The control system is comprised primarily of a process 

20 controller 50, pressure distribution controller 52, sensor 25, and a wafer database 54. The 
process controller 50 receives data establishing the process parameters or recipe, and 
sends commands to the CMP machine 56 to control the CMP process. Additionally, 
coupled to the process controller 50 and the CMP machine 56 is the pressure distribution 
controller 52 which controls the pressure within the membrane chambers in the wafer 

25 carrier as described^bove. 

The pressure distribution controller 52 receives data via two routes. First, the 
pressure distribution controller 52 may receive data representative of the reflectance 
measurements in each of the zones on the wafer directly from the sensor 25 . The pressure 
distribution controller 52 includes hardware and software configured to receive the 

3 0 reflectance measurements, determine the appropriate pressure adjustment needed (if any) 
within each zone, and then sends a signal to the CMP machine to selectively adjust the 
pressure within the subject zone as appropriate. The reflectance data from the sensor is 
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also transmitted to, and stored in, the wafer database 54. 

In an alternative embodiment, pre-determined pressure profile values and/or 
* threshold values for each of the zones are stored in the wafer database 54. These values 
are then transmitted to the process controller 50 or the pressure distribution controller 52. 
5 The pressure distribution controller compares these values to the actual, real-time 
reflectance values from the sensor 25 and sends a signal to the CMP machine 56 to adjust 
the pressure in each of the zones as appropriate. Additional data, such as the pre-polish 
thickness of the wafer 58 and/or the post-polish thickness of the wafer 60 may be sent to 
the wafer database to assist in determining the appropriate pressure adjustment. 

10 In another embodiment of the present invention, model based detection may be 

used to monitor and control the CMP process. Specifically, model based control provides 
for the real time adjustment of the CMP process parameters to better tailor the CMP 
process to the most effective and efficient process. The detection systems described 
above focus primarily on selectively controlling the pressure in the zones to provide for 

1 5 substantially uniform polishing of the localized regions of the wafer. This minimizes the 
occurrence of over-polishing in some regions and under-polishing in other regions. 

The model based detection and control system evaluates the amount of scattering 
in the reflectance signal received from the sensor. As described above, the inventors have 
found that the degree of scattering is indicative of the topography of the surface layer on 

20 the wafer. The extent of scattering of the signal may be evaluated based on statistical 
techniques such as determining the standard deviation and the variation in the mean as 
well as the shape of distribution. When a high level of scattering is seen the CMP 
process can be adjusted to give better planarization. As planarization proceeds, the 
surface layer the topographical variations begin to flatten out, and the scattering of the 

25 signal decreases. As this occurs the CMP process can again be adjusted to increase the 
removal rate of material from the surface of the wafer. These process adjustments can be 
made for example, by varying the relative velocity and applied pressure process 
parameters, and such adjustments can be made selectively within each of the zones as 
appropriate. Thus, the degree of scattering of the reflectance signal can used as an 

30 indicator of the material removal rate, and the polishing state of the wafer at certain 
localized regions on the wafer, and this information can be used to adjust the CMP 
process parameters. 
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In another aspect of the present invention, a method of chemical mechanical 
polishing is provided. In general, the method comprises the steps of: providing a CMP 

» 

* machine which includes a polishing pad and a wafer carrier having multiple chambers that 
allow for independently varying pressure within the chambers that urge against a wafer 

5 at corresponding localized regions on the wafer; measuring the reflectance of the surface 
of the wafer during polishing at each of the localized regions on the wafer; processing the 
reflectance data to determine the state of polishing within each of the localized regions; 
and independently adjusting the pressure within any one of the chambers responsive to 
the state of polishing within each of the corresponding localized regions. 

10 More specifically, in one embodiment the method of the present invention may 

be carried out as illustrated by the flowchart of Fig. 12. A CMP machine is provided and 
wafer polishing begins at step 100. The CMP machine includes means for varying the 
pressure against the wafer at localized regions, such as the flexible membrane having 
chambers that define zones on the wafer as described above. It should be noted however, 

1 5 that the present invention is not limited to this particular configuration, and other means 
that provide for independent control of the pressure at localized regions of the wafer may 
be used. 

To provide for localized control of the pressure, and therefore localized material 
removal rate on the wafer, the sensor position is monitored at step 110 using conventional 

20 means. The reflectance signal is measured and recorded at step 112. At step 1 14 the 
signal measurements are separated into zone. The reflectance signal for each of the zones 
is then processed at step 1 16a-l 16d. As described above, processing of the signal may 
be performed in a variety of ways. For example, the reflectance signal may be compared 
to a threshold value or to a pressure profile. Based on the output of the processing of the 

25 signalatsteps 116a-116b,adecisionismadeatstep regarding whether the pressure needs 
adjusting in any one of the localized zones. The inquiry is made for each of the zones at 
steps 1 16a-l 16d (four zones in the exemplary embodiment), and the pressure is reduced 
when the inquiry is positive at steps 1 18a-l 18d. 

Fig. 13 shows the method, particularly the processing step, in greater detail. The 

30 method begins at step 130 with polishing of the wafer at step 132. During polishing, the 
reflectance at various zones on the wafer is measured at step 134. The reflectance data 
measurements are separated or grouped into zones depending on the position of the sensor 
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when the data was gathered at step 136. The grouped data is then individually processed. 
In one example, the grouped data is processed to calculate the average reflectance in each 
• of the zones at step 138, data is stored at step 140, and a filtering average is obtained at 
step 142. The same reflectance data is also processed to calculate the standard deviation 
5 of the data in each of the zones, and to obtain the filtering average at steps 144 and 146. 
The standard deviation data is stored at step 148. The moving average values from both 
processing steps 142 and 146 are compared against previous, expected or threshold values 
at step 150. Tf the values do not differ in any of the zones, the polishing process continues 
without adjustment. If the values do differ in any one or all of the zones, the pressure in 

10 the zone(s) is independently adjusted accordingly at step 152. When all of the zones 
exhibit reflectance data that is indicative of end-point (as compared to previous, expected 
or threshold values) then the polishing process stops. 

In another aspect of the present invention, surface conditions on the wafer are 
determined, and in particular as shown in the exemplary embodiments, the surface 

15 conditions on blanket and patterned cooper wafers are evaluated. 

The scattering oflight by a periodic wavy surface as shown in Figures 14, 15aand 
15b has been investigated by many researchers (Rayleigh, 1907;Eckart, 1933;Beckmann 
and Spizzichino, 1963; Uretsky, 1965; Desanto, 1975 and 1981). The important 
formulations and their solution are reviewed in this section for the purpose of 

20 understanding the effects of pattern geometry on the surface reflectance by scattering. 
Consider the problem of plane wave being scattering by a periodic surface S 9 where z = 
h(x), as shown in Equation 1 . Let E x and E 2 denote the incident and scattered fields. The 
incident light (electric) field E h assumed to be unit amplitude, can be expressed as 

E\ =exp[(^ sin^x-A, cos^z)-^; (i) 

25 where kj is the wave number of the incident light wave (£, = 2it/X), 0j the incident angle, 
o) the angular frequency (co = 27$), and t the time. If only the scattered field at a fixed 
time is concerned, the exp(-ztftf) can be further suppressed for simplicity. The scattered 
field E 2 at any point P of observation above the surface is given by the Holmholtz integral 
(Beckmann, 1963) 
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with 

^=exp(V)/r (3) 

where r is the distance between die given observation point P and any point on the surface 
(x, h(x)) 9 and k 2 is the wave number of the scattered wave (k 2 = fcj = 2n/A). The point P 
is assumed in the Fraunhofer zone, i.e. r - > , to focus on the plane scattered waves 
rather than spherical ones. In order to solve the scattered field i^in Equation 2, the total 
field E and its normal derivatives/^ on the boundary surface must be can be specified, 
which can be approximated as ("Kirchhoff s method") 

(4 =(i+ r )£i ( 4) 

and 



=(l-r)£ 1 (k 1 -n) 



dn 



s 



= (1 -y)E x (*, sme x -k, co&J (5) 
ck 

where /is the reflection coefficient of a planar surface, and n is the unit vector normal to 
15 the surface at the interested point. The reflection coefficient ^depends not only on the 
local angle of incidence and the electrical properties of the surface material, but also on 
the polarization of the incident wave. For simplicity, the surface is assumed to be 
perfectly conducting, i.e. y= -1 for a horizontal polarization (electric vector perpendicular 
with the incident plane) for the following analysis. 
20 Equation 2 can be integrated over a specified periodic surface profile, such as a 

sinusoidal surface pattern. 

Z =Ux) =(Mi)cosfyvc/ A), (6) 
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where Ah is the half step-height and A the pitch of the features. The scattered field will 
.also follow the same period A along the x direction, which help simplified the integration 
term in Equation 2 by calculating the integration within one period instead of over the 
entire surface. Moreover, periodicity of the problem implies that the scattered field can 
5 be written as a superposition of the Fourier series representing the plane waves at different 
modes, in which the reflective (scattering) angle of each mode 0 2m follows the relation 
(grating equation). 

siitf^ =sin^ +mXIK (m=0,±l±2,... (7) 

The zero mode represents the condition of specular reflection, in which 0 2 = O h 
10 and the direction of the scattered plane wave will be away from the specular angle for 
larger m. The solution for the scattered field at the primary direction of each mode 0 2m 
at far field can be obtained by applying Equations 3, 4, 5, 6 and 7 into Equation 2 and 
performing integration over the surface (~L<k<L). The reflection coefficient /is written 
as a function of optical properties of the coating and the local angle of incidence to 
1 5 calculate the integration. The result can be normalized by the field reflected on a specular 
planar surface E 20 > which defines the scattering coefficient § (=E/E 20 ), and can be written 
as (Beckmann, 1963) 

mAJ--"*i 1+ T® + V (-*TUs)+CM) (8) 

COSt/j +cost/ 2m 

where J is the Bessel function, S =27E$l/ /\(cO$ +CO^), and ttj the residual parts of 

20 the ratio L/Al Equation 8 just gives the scattering coefficient at the primary scattering 
angle of each mode. For all the direction at angle 0 2 , the results is given as 

Insmpx* cos0j + cos0 2 |_ n o J 

25 where p = (L/A)(sm0j - sin6^ , S = 2Mft//i(cOS^ +COS0 2 ) and n the integer parts of the 

ratio L/A. In the far field (Fraunhofer zone, i.e. r-> ), only one mode of scattered 
plane wave can be observed at the given point P (in the direction of 0 2 ) 9 as shown in 
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Equation 1. As shown in Equation 1 in the near field, or the Fresnel zone, the total 
scattered field at P, normalized by E 20 , is given by superposing all the scattering modes 

i 

* contributed from the neighboring periodic surface. Both the amplitude and the direction 
of each mode, given by Equations 8 and 9, and the phase difference between each mode 

5 must be considered to calculate the total scattered field. In practice, the calculation of the . 
total scattered field may be complicated and needed to be performed numerically for the 
sensor located near the measured surface. It was shown that diffusion scattering takes 
place when the Ah/X ratio increases with constant pitch A (Brekhovskikh, 1952). Light 
will be scattered away from the direction of specular reflection, i.e., light is reflected into 

1 0 the direction of higher scattered modes (larger m), and will not be received by the sensor. 
Therefore the surface reflectance, which is proportional to the square of amplitude of the 
reflecting field, decreases with the step-height of the feature Ah with Ah comparable or 
larger than the wavelength of incident light. On the contrary, when the surface is 
planarized, i.e. Ah « 0, the surface reflectance will be close to that of a specular surface. 

15 Moreover, based on the law of energy conservation, the overall scattering coefficient <f> 
should be always equal or less than unity. 

It is noted that the number of possible modes m for the scattering field is limited 

by the condition that = S\V0 n should be less than unity. If 27i/kL (or ML) is close to 

unity, i.e. the wavelength is close to the waviness of the pattern, there will be one mode 
20 only and the surface will reflect specularly regardless its roughness. For the submicron 
Cu patterns employed in current design, the reflectance measured at the onset of process 
endpoint by a light source with comparable or larger wavelength essentially will indicate 
the Cu area fraction only. The slight surface topography due to overpolishing and dishing 
will not affect the reflectance significantly. As shown in Equation 2, the surface 
25 reflectance R, proportional to the square of the reflection coefficient, of the composite 
surface at the onset of endpoint therefore can be written as 

R=A f B Ctt +Q.-A f )B 0xid (io) 

whereby is the area fraction of Cu interconnects, and R cu and R oxide the reflectances of Cu 
and TEOS, respectively, in specular reflection. 
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The sensor loci on the rotating wafer surface can be determined by the relative 
velocity of the sensor to the wafer and the initial position of the sensor, as shown in 

i; 

- Equation 3. The relative velocity of the sensor on the rotating wafer can be obtained by 
the two steps: find the relative velocity of the sensor to the stationary X, Y coordinates 
5 fixed at the center of the wafer and then performing a coordinate transformation with 
respect to the wafer rotation. The velocity components for the sensor, v Xs , and v y „ and 
the wafer, v^, and v Yw , inX,Y coordinates can be expressed as shown in Figure 2. 

=-n^sin^+<9 0 )-r cc (iia) 

v Y j=r s G) p cos(p p t+0 Q ) (lib) 

10 and 

V* w =-Wil)0 . (12a) 

v y>w =# w fccos0-r cc ) (12b) 

where r s is the offset of the sensor from the center of the platen, the offset of the 
centers of the wafer and the platen, <o w and a> p are the angular velocity of the wafer and 
15 the platen, and 6>the angle of the sensor with respect to the ^coordinate. In addition to 
wafer rotation, in practice the wafer may translate relatively to the paten center, so called 

sweeping, with a velocity r cc to utilize the entire pad surface. For simplicity, it is 

assumed that the sweeping is along the X coordinate. Therefore, the components of the 
relative velocity of the sensor to the wafer, v XR and v rR , inX, 7 coordinates can be written 
20 as 

v x * = v xj -^ w ,=K» p sin^+6> 0 )-r cc ]+r A sin^ 

=-r s (tf> p -»Jsin^+0 o )-r cc (13a) 

and 



v y,r = V Y* -\ w =r s a p cosfy p t+0 o )-a) w (r s co&-r cc ) 
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=r s (co p -ajcosfyt+e^+a)^ (13b) 

These velocity components can also be represented in terms of a rotating 
coordinate system (x, y), with the original located at the center of the wafer and rotating 
at the same angular velocity co w as the wafer. The velocity components on the rotating 
coordinates, v XiR and v yR , are given by the coordinate transformation rule 

VJ r siri0) J cosw JI v y,r] K } 

and can be written as 

v x ,r =-r s (6) p ~a) w )sm((p p -a?J+0 o )+r cc (D w siiw^ (15a) 

V "^K "^w) c os(^, -o> w )t+0 o )+ r cc (D w cosDj+r cc smo)J (15b) 

1 0 Therefore, the displacement of the sensor on the wafer with respect to the rotating 

x, y coordinates is given by integrating the velocity in Equations 15a and 15b. 

x=\v xJi dt 

= -r s (a) p -a) w ) Jsin[^ -(o w )t+0 o ]dt+G) w jr cc swa)jdt- facosajdt (16a) 

15 =r s {a) p -o) w ) Jcos[fc) p -a> w )t+0 o ]dt+a) w jr cc coxojdt+ jr cc siwDjdt (16b) 

To solve Equations 1 6a and 1 6b for the position of the sensor on the wafer surface 
at a given time, a initial condition must be prescribed. It is convenient to assume that the 
sensor is initially located at the edge the wafer, with a initial angle 9 0 with respect to the 
fixed ^coordinate. For simplicity, it is also assumed that no sweeping motion occurs in 

20 polishing, i.e., r cc = 0 . In practice, the effect of sweeping motion on the sensor traj ectory 

across the wafer can be neglected if the sweeping velocity is much lower than the liner 
velocities of the wafer relative to the pad. With those assumptions, the position of the 
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sensor on the wafer can be expressed as 

x=r s cos[fy p -o)Jt+0 o ]-r cc cosoJ (17a) 

y = r s sii% -6) w )t +0 o ]+r cc sirwj (nb) 

As long as the condition x 2 ^ < r w (where r w is the radius of the wafer) is satisfied, 
5 the sensor is located inside the wafer/pad contact interface. Since the wafer is faced 
against the platen in polishing, the sensor trajectory given in Equations 16 and 17 is 
observed from the wafer back-side. The trajectory on the front surface is symmetric to 
the results from Equations 16 and 17 with respect to the;; axis. 

When the angular velocities of the wafer and the platen are equal, i.e. a> w = o p9 
10 Equations 17a and 17b can be further simplified and the locus of the sensor is an arc with 
the radius equals to r cc and centered at (r s cos&, rjsinO,) relative to the rotating x, y 
coordinates. 

(x-r s cos0 o ) 2 +(y-r 5 sin<9 0 ) 2 =r cc 2 . (18) 

When the angular velocities of the wafer and the platen are the same, the sensor 
1 5 enters the wafer/pad interface at the same point on the periphery of the wafer and always 
produces the same locus on the wafer surface, as shown in Figure 17. In practice, the 
angular velocity of the wafer must be slightly offset from the platen so that the sensor can 
scan over the entire wafer surface in different radial directions. Figure 18 shows the 
sensor loci for the conditions, a> w = 1.05 co p and r s = r cc , in which twenty identical loci 
20 start from twenty equally spaced points on the periphery of the wafer edge, respectively, 
and repeatedly if no wafer slippage occurs. As illustrated, the sampling density will be 
much higher at the center of the wafer, but lower at the edge where more dies are located. 
The lower sampling density on the edge dies might result in bias inference for the overall 
surface condition. How to design of sensor loci to sample enough data on desirable 
25 surface area will be discussed later in detail. 

The surface conditions of the wafer during polishing can be extracted from the real- 
time reflectance data. The statistics employed to infer the surface conditions include the 
maximum and minimum reflectance values, the range, the mean value, the variation, the 
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shape of the distribution of the reflectance data, etc. Three levels, including wafer-, die- 
and device- or subdie-level, of information can be obtained from the dataset. The spot 
* size of the sensor is so chosen that it is comparable or smaller than the subdie area but still 
much larger than dimensions of interconnects. Therefore, an individual measurement 

5 represents the reflectance on the specific device or pattern area on the wafer, from which 
the surface topography and Cu area fraction can be inferred. In reality, however, it is 
difficult to map the measurement results onto the exact location of a particular device or 
pattern because of wafer slippage inside the carrier. The individual datum only can be 
mapped onto the surface within a grossly defined area. Similarly, the die-level 

10 information may be obtained based on the samples along a specific segment(s) 
corresponding to the die location on the loci. However, it may only roughly represent the 
surface condition within the vicinity of the interested die region. Fortunately, the 
polishing results for the dies at the same radius to the wafer center very often exhibit the 
similar trend. Hence, data from within adjacent dies at the same radius sometimes may 

15 be combined to increase the sample size for the die at a particular radius to elucidate the 
spatial dependence of material removal in the radial direction. 

Moreover, the wafer-level information can be retrieved either from a single scan 
or multiple scans across the wafers. In the practice of endpoint detection, it is preferable 
to take enough samples from multiple loci so that the surface condition over a specific 

20 region or even on the entire wafer surface can be determined from this combined (or 
"pooled") dataset. The more loci are employed, the more uniformly the samples and the 
larger size of samples can be taken on the surface. Therefore, a higher level of inference 
can be achieved. The only concern is that the surface condition may change significantly 
during a long sampling period of multiple scans. This may affect the reliability of the 

25 inference and will delay decision making and feedback control. In order to eliminate this 
drawback, the moving average method is employed to estimate the average reflectance 
on the surface. The sensor scans across the wafer surface once per platen revolution. 
Suppose the reflectance sampled at the j-th point along the locus at the i-th time period, 
each time period is equal to the duration for one revolution of platen, is denoted as x tj . If 

30 total n points are taken along each locus, the mean reflectance along the locus at the i-th 

period, X h is given as 
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Suppose the number of loci to cover the entire wafer surface or a area of interest 
is w, the moving average of the sampling reflectance at the Uh period, M h is defined as 

w 

5 That is, at the i-th time period, the observations from the newest one scan and the 

previous (w-1 ) scans are employed to estimate the mean reflectance of the entire wafer 
or the surface of interest. Thus, the surface condition inferred from the reflectance 
measurements can be updated every scan. For example, it is about 10 scans to have the 
sensor cover the wafer at the condition of o w = 1.05o> p . If the platen runs at 75 rpm, it 

1 0 takes 8 seconds to scan over the entire surface, in which the locus rotates 180° relative to 
the wafer, and 1 6 second to rotate back to the first locus. The moving average can capture 
the change of surface reflectance due to both the change of surface topography and the 
change of Cu area fraction within a short period, in this case less one second. However, 
it may still smooth over the rapid change due to the partial oxide exposure on small 

1 5 portion of the wafer surface near the onset of endpoint by averaging the current data with 
the previous data (which is taken across 8 seconds in the example). 

On the other hand, the (total) variance of the surface reflectance at i-th time 

period,*^, can be estimated based on the same pooled dataset employed in the moving 
average. 

where AT is the total number of samples in the moving average subset (N = wn). The total 
variance is calculated based on the deviation of the reflectance at each sampling point 
relative to the total estimated mean of the entire wafer or the surface of interest, which is 
estimated by the moving average. In addition to the (total) variance, the variance along 
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each locus, the range of data, and their maximum and minimum must be tracked to assist 
identify the rapid change of surface reflectance at the moment when barrier or oxide layer 
• exposes. It can be employed to determine the percent overpolished area on the wafer 
surface at the end of the process. Additionally, the distribution of the data can be used 
5 to determine the regime of polishing. For example, the skewness of the data distribution 
in polishing can be compared with the theoretical value at end-point, which can be 
estimated based on the given pattern layout and sensor kinematics. The definition of 
skewness fi can be found in many statistics texts, and may be defined as (Sachs, 1982) 

P=^y J - (22) 

10 where X is the mean, X the median and S the sample standard deviation of the selected 
dataset, which can be estimated from one locus or multiple loci, which can be calculated 
from Equations 19, 20, and 21. These statistics can also be applied to the die-level 
estimation of surface condition. For instance, data taken within a specific range of radius 
(a ring region) can be combined, the same statistical methods can be employed to 

1 5 estimated the surface reflectance over the specific area. The effectiveness of each of these 
methods on endpoint detection will be examined in the discussion section. 

The following experiments are provided for illustration purposes only, and are not 
intended to limit the scope of the invention in any way. An optical sensor unit (Philtec 
D64) composed of light-emitting diodes (LEDs), bundled glass fibers for light 

20 transmission and receiving, and an amplifier was employed for detecting the conditions 
of the wafer surface based on the surface reflectance. The specifications of the sensor are 
listed in Table 1. 

Table 1 : Specifications of the reflectance sensor. 



25 



Item Specification 



30 



Light Source 


High Intensity LED 


Wavelength (nm) 


780-990 (>i=880, a=50) 


Spot Diameter (mm) 


1.6 


Light Beam Spread (°) 


30 


Operation Distance (mm) 


0-6.35 


Stability (%) 


<0.1% Ml scale 


Freauencv Response (kHz) 


<20 
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As shown in Figure 1 9, the spectrum of the LED light source ranges from 775 nm 
to 990 nm, with a mean around 880 nm and standard deviation about 60 nm. At the 
* sensor tip, the uncollimated light rays diverge outward from the transmit fibers, and only 
the reflected light within the area with the same diameter, about 1 .6 mm, of fiber bundle 

5 is received. The particular spot size was chosen so that it is small enough to detect 
different surface conditions on different patterns (sub-die areas) on the wafer. However, 
it is larger than the individual line or feature to even out the small variation of reflectance 
due to local (sub-device level) randomness of material removal. Because of the 
divergence of the light beam, the sensor is sensitive to the gap between the tip and the 

1 0 targeted surface. Figure 20 shows the characteristic of the sensor output (reflectance) on 
a mirror surface corresponding to the gap distance. In practice, the sensor was operated 
at a distance of around 5 mm so that the sensor response is less sensitive to the slight 
change of gap distance during polishing or the surface waviness of the wafer. 

The sensor unit was installed on the platen base with the tip embedded inside a 

1 5 holder through the platen. On the polyurethane polishing pad stacked on the platen, a 
translucent window made of plastic (Rodel JR111) was employed to enable the sensor 
view the wafer surface. The material of the window has similar wear properties as those 
of the pad so that the surface of the window remained at the same level of the rest of the 
pad surface and did not affect the sensor measurement or polishing uniformity. The 

20 sensor was linked to a power supply and a data acquisition system via the rotary coupling. 
The output signal was amplified before the coupling to enhance the signal to noise ratio. 
Additionally, an off-line set-up was employed to measure the surface reflectance of the 
polished wafer. Two rotary stages with angle readings were utilized to mimic the 
kinematics due to the rotation motion of the wafer carrier and the platen. The position of 

25 the sensor on the wafer were determined based on the angles of both the rotation of wafer 
and sensor arm and the distance between two centers of rotary stages. By comparing the 
measurements from the this set-up to those from in-situ sensing, the effect of slurry and 
wafer slippage on the reflectance sensing may be identified. 

Both blanket and patterned Cu wafers were employed for experiments to verify the 

30 capability of the sensor and to determine the detection schemes. The blanket Cu wafer 
was composed of a 20 nm TaN barrier layer and then followed by a 1 \xm thick PVD Cu 
coating on a Si substrate. For the patterned wafer, a tested damascene structure was 
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employed, which was composed of an array of line-spacing structures with different 
linewidths and pitches. A detailed floor layout of the pattern can be found in a previous 
-'chapter. This pattern is transferred into a 1.5 ^m thick TEOS coating with trenches 
etched to a depth of 1 \im on a 100 mm silicon substrate. A 20 nm Ta layer followed by 
5 a 1 nm thick PVD Cu layer was deposited on the top of the patterned oxide surface. The 
experimental conditions are listed in Table 2. 

Table 2: Experimental conditions. 



Experimental Paramaters 


Experimental Conditions 


Diameter of Wafer (mm) 


100 


Normal Load (N) 


391 


Normal Pressure (kPa) 


48 


Rotational Speed (rpm) 


75 


Linear Velocity (m/s) 


0.70 


Duration (min) 


1-6 


Sliding Distance (m) 


42 - 252 


Slurry Flow Rate (ml/min) 


150 


Abrasive 


a-Al 2 0 3 


Abrasive Size (nm) 


300 


M 


7 



20 

In this section, experimental results of blanket and patterned Cu wafers are 
examined to study the characteristics of the reflectance sensing technique. The 
reflectance of a planar Cu area measured in polishing may deviate from the theoretical 
value due to surface roughness, slurry particles, variation of gap between the wafer and 

25 the sensor in polishing, and random noise from various sources. Variation of surface 
reflectance due to these effects is studied based on the measurement on blanket wafer 
polishing. Additionally, the surface reflection in patterned wafer polishing is affected the 
surface topography in the planarization regime and by the area fraction in the polishing 
regime, which significantly contributes to the variation of measurements. Both off-line 

30 and in-situ measurements were conducted to study the effects of pattern geometry and Cu 
area fraction on the reflectance. These results are compared with the reflectance from 
light scattering theory with the assumptions of single wavelength, plane incident wave 
and periodic surface structure. The characteristics of reflectance across the wafer or a 
desired area during polishing are examined to correlate the measurements with different 

35 regimes of Cu CMP. These will help establish different schemes for in-situ sensing and 
endpoint detection. 
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Tests on the Blanket Wafers. 

A typical result of surface reflectance on a blanket Cu wafer during polishing is 
* shown in the figures. To elucidate the effects of slurry and scratching, the normalized 
mean reflectance is defined as the average reflectance over ten passes across the wafer 

5 divided by the reflectance on a scratch-free Cu wafer under the same pressure condition 
(at the same gap between the wafer surface and the sensor). At the initial stage, the 
reflectance was about 30% less than that without slurry. The reduction was due to the 
light scattering from slurry particles and the increase of gap distance resulting from the 
presence of slurry layer. Since the sensor was operated in the range in which it is less 

1 0 sensitive to the change of gap distance, the decrease of reflectance was mainly due to the 
particle scattering. The normalized mean reflectance gradually dropped 0.1 to about 0.6 
after 30 seconds of polishing and the standard deviation increased to about 0. 1 5 from the 
initial small value. These indicate that the surface was roughened due to particle abrasion. 
Thereafter the mean reflectance and the standard deviation remained at constant levels for 

15 about 3 minutes. After 4 minutes, the variation of the surface reflectance increased 
without change of the mean. Inspection of wafer surface at this stage indicated that a 
small portion of the Cu was cleared and the less reflective TaN was exposed on the 
surface. Since the majority of the surface was still covered with Cu, the mean did not 
drop significantly. Then, the mean started to drop and the variation kept increasing with 

20 the increase of the Cu clearing. Until the majority of Cu was cleared, about 6 minutes, 
the standard deviation kept decreasing and the mean gradually reached a lower level. The 
harder TaN barrier acted like a polishing stop and retained a low level of variation of 
surface reflectance after all the Cu is removed. After overpolishing for two more minutes, 
the TaN was polished through and the mean reflectance decreased further. 

25 Off-Line Measurements on Patterned Wafers. 

The effects of surface topography on reflectance are shown in Figures 19 and 20. 
These data were observed off-line on patterns at the center die with various linewidths and 
constant area fractions of 0.5 and 0.01, respectively. The normalized reflectance is 
defined as normalizing the measured reflectance on each sub-die by the reflectance on the 

30 unpolished blanket Cu surface. The corresponding step-height evolution for these 
damascene structures (sub-dies) is shown in Fig. 21 . To extend the planarization regime, 
lower nominal pressure (28 kPa) and relative velocity (0.46 m/s) were applied than those 
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of the industrial practice. By six minutes, most of the high features were removed and the 
surface had planarized before the Cu was polished through. For the patterns of 0.5 area 
* fraction, the initial variation of the reflectance resulted from the variation of step-height 
and pitch on the surfaces of different sub-die. Since the initial step-heights are close for 

5 the patterns with linewidth 2, 25 and 100 |xm, except that of the 0.5 \im structures, the 
reflectance is mainly affected by the pitch (or linewidth) of the pattern. The smaller the 
pitch, the more light scattering occurs on the surface and reduces the reflects This can 
be explained by the less reflective Cu surfaces on low features due to the coarse 
microstructure from the deposition process. After being polishing for two minutes, the 

10 normalized reflectance decreased, about 0.1, instead of increasing gradually with the 
reduction of step-height. This is because the surface roughness increased by particle 
abrasion and contributed to the overall reduction of the surface reflectance. The 
reflectance of the 0.5 |im line area, however, increased because the surface was mostly 
planarized before two minutes. 

1 5 The reflectance increased gradually for each pattern after the initial drop and then 

finally reached a steady value due to the planarization of high features. This trend has 
been explained in the theory section that the light is more likely to scatter into the 
direction of specular reflection and received by the adjacent receive fibers when the step- 
height decreases. As shown in Figures 22 and 24, the step-heights for various features 

20 were less than 100 nm after polishing for 5 minutes, and the normalized surface 
reflectance for various features reached a similar steady level, about 0.85, on the tested 
wafers. This implies that the employed optical sensing technique is less sensitive to the 
small variation of the surface topography. The reflectance for the patterns of 0.01 area 
fraction also dropped to about 0.1 due to the increase of surface roughness and then 

25 remained at the same level of 0.9 till the surface was planarized. Since the area fraction 
is small, the surface reflectance is not significantly affected by the evolution of the pattern 
topography, and the measurements are similar to those on a blanket Cu surface. 

Figures 22 and 23 show the trend of surface reflectance of various patterns, with 
0.5 and 0.0 1 area fractions, in the different process regimes - planarization, polishing and 

30 overpolishing. The corresponding evolution of dishing is shown in Figures 24 and 25, 
respectively. The pressure and the velocity applied was close to the industrial practice of 
48 kPa and 0.79 m/s. The surface topography was planarized on most of the patterns after - 
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1 minute of polishing and the normalized reflectance reached a similar level about 0.9 for 
all patterns tested. Between 1 and 3 minutes, the planar Cu layer was removed like that 
* in blanket Cu polishing and the normalized reflectance stayed the same constant about 0.9 
and was independent of original pattern geometry. After about 3 minutes, the reflectance 

5 dropped significantly and sharply because the Cu layer had been polished through and the 
less reflective underlying oxide appeared partially on the surface. Since the planarization 
rate is dependent on the pattern geometry, the sub-die areas with higher area fraction may 
Tiave been polished through faster. In Figures 22 and 23, the sub-die with high area 
fraction of 0.5 was polished through first and the Ta barrier was exposed after about 2 

10 minutes. Concurrently, the reflectance started to drop to about 0.8 when the Ta started 
to expose and then further down to about 0.5 when the oxide surface was exposed at 3 
minutes. Nevertheless, all tested patterns seemed to reach the onset of oxide exposure, 
between 2 and 3 minutes. 

After the onset of oxide exposure, the reflectance kept decreasing until all the 

15 excess Cu and barrier (Ta) materials were removed (i.e., process endpoint), after about 
four minutes of polishing. After the endpoint, the reflectance seemed to remain constant, 
regardless of the slight increase of topography due to dishing of the soft Cu lines and 
rounding and overpolishdng on the adjacent oxide regions. This again agrees with the 
earlier results in that the employed sensing technique is not sensitive to the small variation 

20 of the step-height. Hence, the variation of the reflectance in this regime was mainly due 
to the different area fraction of Cu interconnects. The areas with higher area fraction 
generally are more reflective. However, the experimental values were lower than those 
of theoretical prediction of reflectance for all patterns, especially for those with high area 
fractions. The theory predicts that the (normalized) reflectance is about 0.62 and 0.24 for 

25 the patterns with area fractions of 0.5 and 0.01, respectively, in which the Rteo^cu rat i° 
of 0.23 is assumed based on the experimental measurement on blanket films. In reality, 
the light transmitted through the oxide and reflected from the underlying Si substrate may 
be blocked by the Cu lines, which decreases the intensity of reflected light from the oxide 
surface arid reduces overall reflectance of the sub-die. Additionally, scratches and less 

30 reflective Cu oxides (due to corrosion) were found on the surfaces of Cu lines, which also 
resulted in a reduction of surface reflectance, especially for the pattern with more Cu area 
fraction. 
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Off-Line Measurements Along the Sensor Loci. 

The off-line measurements along different sensor loci in terms of the mean value 
- and the standard deviation are plotted in Figure 26. The wafer employed is the one shown 
in a prior section and polished for 4 minutes at normal conditions, in which the majority 
5 of dies have been polished to the end-point and some may have been overpolished 
slightly. The loci employed follow the sensor trajectories in polishing at the conditions 
of co w = co p and r s = r cc , in which the sensor travels along the arc of a radius r cc . Loci 
across different radial directions were employed to elucidate the effects of different loci 
on the statistics of the surface reflectance of the patterned wafer. It was found that the 

10 mean and the variation of reflectance data across wafer varied with the orientation of the 
locus. The mean value varied from 0.24 to 0.26 among the selected loci, compared to of 
the average reflectance about 0.25 of the center die. The standard deviation varied 
between 1 and 1.2, compared tp 1.8 in the center die. The variations of the mean and 
standard variation mainly resulted from the different sensor loci due to the non-axial- 

15 symmetric pattern layout and from the within-wafer nonuniform polishing. It is not 
uncommon that the within-wafer nonuniform polishing often exhibits an axial symmetric 
fashion, such as "bull's eye effect" (Stine, 1997). Therefore, the variations of reflectance 
between loci due to wafer-level nonuniformity may be comparable to that contributed 
from the pattern layout. 

20 Figure 27 shows the mean and standard deviation of the surface reflectance on the 

center die and across wafer on the off-line measurement set-up at different polishing 
stages. The effect of different loci is minimized by combining data from several loci, for 
instance from 5 different loci evenly across the wafer in this case. The effect of within- 
wafer nonuniform polishing on the variation of surface reflectance can be determined by 

25 comparing the difference between those two data sets. Before polishing, the mean 
reflectance across the wafer is higher than that on the center die because of the 
nonuniform coating from the Cu PVD process. The step-heights of patterns are found 
smaller at the edge dies and thus the average reflectance on edge dies will be higher than 
that of center dies. Hence the overall mean reflectance is smaller than that of the center 

30 die. Similarly, the standard deviation of the edge dies is generally smaller because 
trenches is more shallow due to the nonuniform Cu deposition. After polishing for a short 
duration, the overall mean became less than the average reflectance of the center die. This 
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is because the polished rate at the edge was faster than at the center, and the less reflective 
barrier and/or oxide layers were expose at the wafer edge. The standard deviation of the 
' reflectance across the wafer was also greater than that of the center with the increase of 
surface nonuniformity. More barrier and oxide layers were exposed and progressed from 
5 the edge toward the center with the increase of time. With the increase of the wafer-level 
nonuniformity, the difference between the two means and the standard deviations 
increased continuously. Until the majority of the dies reach the end-point, the mean 
surface reflectances across the wafer and at the center return to similar levels because the 
hard oxide layers retains the surface uniformity even with a slight overpolishing and the 

1 0 small dishing will not affect the reflectance significantly. The variation of the reflectance 
of the center die of the 4-minute sample is greater due to the remaining small patches of 
Cu/banier materials- In practice, the overall mean and variation of the reflectance may 
be compared with those on different surface areas (die-level zones) to determine the 
process endpoint. 

15 In-Situ Measurement on Patterned Wafers. 

An example of in-situ measurement on patterned Cu wafer is shown in Figure 28. 
The y-axis represents the raw data of normalized surface reflectance, which is defined as 
the reflectance measured divided by the reflectance on blanket Cu wafers before 
polishing. In the experiments, the angular velocity of the wafer was offset from the 

20 angular velocity of platen by 5 percent (a> w = 1 .05 a> p ) so that the loci covered the wafer 
surface. The moving average of the reflectance for ten passes and the standard deviation 
based on the pooled data from those passes are shown in Fig. 29. Compared with that 
from the off-line apparatus, the reflectance measured in polishing was lower because of 
light scattering by the slurry. It dropped approximately 20% to 25% in the planarization 

25 regime, but less significantly in the overpolishing regime. The mean decreased slightly 
right after polishing because of surface roughening. Then it started to increase until 
reaching a constant level around 1 minute after the surface had been planarized, as 
discussed in the earlier paragraphs. After 2 minutes, the mean dropped again because of 
the exposure of Cu on the surface. Since the Cu was removed nonuniformly due to the 

30 initial pattern layout and the variation of the coating thickness, the underlying oxide was 
gradually exposed on the surface and the mean dropped less steeply compared with data 
on a specific die, such as the center die in the earlier example. The onset of wafer-level 



WO 02/10729 PCT/US01/24146 

-38- 

endpoint was about 4 minutes in this experiment and the mean kept increasing, but at a 
slower rate, after the endpoint with the gradual increase of surface roughness due to 
- overpolishing and dishing. Since the effect of slurry and the lack of clear sign for 
endpoint indication, the mean can only serve as a rough indication of the onset of process 
5 endpoint. 

The standard deviation of the pooled data in the moving sampling set over ten 
passes is plotted versus time in Figure 30. Since the variation of the reflectance is mostly 
due to the pattern geometry and Cu area fraction, the distribution is generally not normal. 
The distribution of the normalized reflectance in terms of relative frequency is shown in 

1 0 Figures 3 1 a to 3 1 e, in which the distribution of reflectance from the off-line measurement 
is also shown in dash-line for comparison. There are two peaks of the standard deviation. 
The first peak occurs at the beginning of the process corresponding to the minimum mean 
reflectance in the Cu planarization regime, which resulted from the initial surface 
topography and surface roughening. The initial shape of the distribution remained similar 

15 to that measured off-line, which represents the initial surface topography of the wafer. 
The standard deviation in the planarization regime reached a minimum when the majority 
of the pattern has been smooth down and the mean reached a maximum. The surface 
condition at this stage is similar to that of a blanket wafer. The variation of the surface 
reflectance is affected by the surface roughness, slurry scattering and random error of 

20 measurement and thus represents a normal fashion in Figures 31b and 31c. The 
maximum variation of the reflectance occurs in the middle of Cu clearing regime, at about 
3 minutes of polishing in this case. A broad distribution with two peaks is observed in 
Figure 3 1 d. The subgroup of surface reflectance centered at a lower value represents the 
subdie area on which the oxide is exposed. The other subgroup with the mean close to 

25 the rough blanket surface indicates that the high reflective Cu and/or Ta barrier layers still 
partially cover the surface. After the maximum, the standard deviation decrease quickly 
with the increase of area of oxide exposure. At the onset of endpoint, he standard 
deviation reach a sharp turning point and then remain at a low constant level. As 
observed in the previous off-line measurement, the variation of the surface reflectance 

30 reach a minimum when the high reflective Cu is cleared. However, since the resolution 
of the sensor is limited by the spot size, it may not be possible to effectively detect the 
small patches of metals on the surface. In practice, a short period of overpolishing may 
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be applied to ensure that all the Cu/barrier materials are removed. After the endpoint, the 
standard deviation is determined by the designed pattern layout (local Cu area fraction) 
- which affects the skewness of the distribution. Therefore, the variation of surface 
reflectance will not change significantly with the small variation of surface topography 

5 resulting from overpolishing and dishing. 
Locus Design and Sampling Plan. 

The sampling scheme relies greatly on the design of sensor loci and sampling 
frequency to achieve an effective plan and provide reliable information of the underlying 
distribution of surface reflectance. At the die-level, many loci must be taken on the die 

10 of interest to detect the variation of reflectance due to the nonuniform topography, Cu 
area fraction and the non-symmetric layout. Based on the kinematics, the sensor locus 
is determined by the parameters of g>„ a> pt r 5 , and r cc . For some conditions, such as the 
example in Figure 5 with co w = 1.05 co p and r s = r cc , the sensor can cover the center die 
with multiple scans but maybe with only pass the edge die with one or even none. One- 

15 way to improve the sampling density on the edge die is to increase the number of loci on 
the wafer by reducing the offset between co w and co p . However, this will increase the time 
period to scan one revolution over the wafer surface and thus may delay the detection of 
rapid changes of reflectance of a local area. The wafer slippage, both rotation and 
translation inside the recess, will also make the control of velocity offset within a small 

20 range very difficult. In reality, the smallest offset of the wafer and the platen velocity is 
about 3% to 5%, typically. 

On the other hand, the distance between centers of the wafer and the platen r s may 
be changed during the polishing. This "sweeping motion" may help cover over a desired 
region on the wafer surface. Figure 32 shows an example at r s = 1 .25 r cc with co w = 1 .05 oo p 

25 and r cc = 0, in which only the outer area is sampled. Compared with the high sampling 

density at the center in Figure 18, the sampling density is much higher and uniform 
around the edge now. In practice, the entire wafer may be scanned first to roughly 
determine the overall surface condition, then the area at a particular radius of interest can 
be scanned with a higher sampling density for a better inference of the local condition. 
30 Moreover, two or more sensors can be installed at different radii r s and different angles 
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(phase) on the same platen. The combined loci will give a higher and more uniform 
distributed sampling density of both the center and the edge region. Another important 
* parameter for designing the sampling plan is the sampling frequency. In order to detect 
the variation of reflectance between the different sub-die areas and different dies, at least 

5 one data must be taken from each subdie along the sensor locus. It is preferable to have 
one or more replicants on each pattern to reduce the error due to random variation in 
measurement. For the 100 mm patterned wafer employed, about 40 subdies are located 
along a locus (ten dies along a locus with 4 subdies across each die diagonal). With at 
least one replication on each subdie area, totally about 1 00 points are required in the test, 

10 which corresponding to 100Hz sampling rate at the typical wafer rotational speed of 60 
rpm. Nevertheless, the sample size can be larger and more replicants can be taken to even 
out the effect of random error if the data acquisition system can provide a higher sampling 
rate. 

Variance Components of the Surface Reflectance. 

15 The surface reflectance of a patterned wafer varies with the surface roughness, 

pattern topography and area fraction, and the optical properties of coating materials. Due 
to the within-wafer nonuniform material removal, the surface topography and the 
remaining fraction of Cu during polishing may vary among different dies across the 
wafer. The within-wafer nonuniform polishing usually results from certain systematic 

20 sources, as nonuniform velocity distribution, pressure distribution, interfacial temperature 
distribution, slurry flow and contact conditions (Stine, 1998). Its effect on polishing 
always follows a systematic pattern and tends to be repeatable between wafers in the same 
lot. On the other hand, the wafer-level nonuniformity affects the pattern evolution on the 
same die with a similar trend. The relative rates of material removal between different 

25 patterns on a die will remain similar to another die at different location because the factors 
that affect wafer-level nonuniformity will have less interaction with the die- or device- 
level polishing behavior. For instance, the die-level polishing is mostly affected by the 
pattern geometry, such as linewidth and area fraction. Therefore, the variation of 
reflectance measurements on a die tends to follow the same distribution and is nested 

30 within the die. Based on this assumption, a two-level nesting variance structure is 
employed to decompose the effects of within-wafer and die-level nonuniform polishing. 
Assuming that the variance at each level is normally distributed, the reflectance at 
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location j of die i on the wafer, R ip can be written as 

Xy=M+*l+D m (23) 

where ju is the average reflectance within a wafer from multiple loci, W t the die-to die (or 
within-wafer) effect on die i, and D m the within-die effect at location ; on die L The total, 

5 within-wafer and within-die variances of surface reflectance are expressedas o£, o^ 9 <J D 

respectively. Additionally, the within-die effect, D j(i)9 is assumed to be normal and the 
two-level variance components are assumed to be independent to each other. Therefore, 

the total variance of reflectance, <T r , can be written as 

0*=0^+0> D (24) 

10 The results of decomposition of estimated variance components, and with respect 
to the in-situ measured data are plotted in Figure 33. The value of each component and 

the F ratio, defined as for every 30 seconds are listed in Table 3 to examine the 

significance of within-wafer nonuniformity on the variation of surface reflectance. 
Additionally, the polishing results for all dies at the same radius are assumed to be similar 

1 5 and are combined into a subset for estimation of the die-level variation. The high F ratio 
on the wafer before polishing indicates that the within-die means at different radii are 
different and the probability of mean difference between dies, Pr(F) (which implies the 
existence of within-wafer nonuniformity), is about than 0.6. This is due to the variation 
of initial step-height from the deposition process. The within-wafer nonuniformity 

20 decreases after polishing starts, and remains at a low level with respect to the total 
variation. The confidence level of the hypothesis - there is a mean difference between the 
dies - is less 20 %. This suggests that the surface is planarized (or topography becomes 
more uniform across the wafer) by polishing. The within-wafer variance and the F ratio 
even drop to very low levels, (Pr(F) ~ 0), after the wafer-level endpoint is reached. This 

25 is because the underlying oxide surface is harder than Cu and can retain the surface 
planarity and the wafer-level polishing uniformity. On the other hand, the within-die 
effect contributes significantly to the total variation of surface reflectance throughout the 
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process. The process endpoint can be determined based on the change of within-die 
variance component as a result of the drastic change of Cu area fraction. In practice, the 
- total variance might be employed to approximate the within-die variance to determine the 
process endpoint. The small effect of within-wafer nonuniformity will not affect the 
5 accuracy of detection. 



Table 3: Analysis of variance of two-level nested model for surface reflectance. 



Time 
(Minutes) 


Within-Wafer 
Variance, ^ 


Within-Die 
Variance, 5? 


F Ratio 


Pr(F) 


0 


15.94X10" 4 


1.64xl0- 3 


0.965 


0.59 


0.5 


3.89 


2.62 


0.149 


0.07 


1.0 


2.62 


1.58 


0.166 


0.08 


1.5 


3.88 


1.54 


0.252 


0.14 


2.0 


7.49 


2.51 


0.299 


0.17 


2.5 


9.30 


8.45 


0.110 


0.05 


3.0 


9.22 


18.11 


0.051 


0.02 


3.5 


7.24 


13.67 


0.053 


0.02 


4.0 


1.39 


3.08 


0.045 


0.01 


4.5 


0.15 


1.01 


0.015 


~0 


5.0 


0.01 


1.04 


0.001 


~0 



20 Moreover, it may be noted that the within-wafer variance is just a indication of the 

nonuniform reflectance of the surface. It may not be directly correlated with the 
uniformity of the remaining Cu thickness. However, it directly represents the uniformity 
of surface condition. This information can be employed to monitor the across-wafer 
surface condition and uniformity. It may also be employed in a feedback control loop to 

25 adjust the process parameters, such as pressure distribution and velocities of wafer carrier 
and platen, to improve the uniformity of polishing. 
Endpoint Detection Algorithms. 

In previous sections, the characteristics of surface reflectance at endpoint and other 
stages of Cu polishing, in terms of moving average, distribution and the variation of the 

30 reflectance across the wafer, were discussed. These characteristics can be employed to 
design the endpoint detection algorithm(s). The moving average can be employed to 
detect the moment that the surface reflectance drops under a certain threshold as shown 
in Figure 29. The threshold is determined by the average area fraction of Cu and the 
optical properties of surface materials with respect to the wavelength(s) employed. 

35 Because of the random effect of slurry scattering, surface roughness and random error, 
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the threshold usually will deviate from the theoretical mean reflectance presented in the 
earlier section and must be determined based on the observations from a few preliminary 
fc tests. Moreover, the sampled reflectance corresponding to the "true" wafer-level endpoint 
will fall into a statistical distribution related to the variation in initial coating uniformity, 
5 the variation of process parameters and the random error from sampling and sensing. 
Accordingly, a hypothesis test must be conducted to ensure that the moving average M 
falls within a given interval with respect to an acceptable confidence level. Since the true 
variance of fhe surface reflectance is not known, the 100(1 -a) confidence interval is 
determined using the appropriate Student t sampling distribution for the sample standard 
1 0 deviation S (Montgomery, 1 996). 



v 



M+L 



(20) 



Figure 34 shows the results of the moving average of the surface reflectance versus time 
with an estimated interval at 99.5% confidence level (a = 0.005). Since the sample size 
N is very large, the estimated true mean is confined to a small interval. Moreover, the 
15 threshold may also have its underlying distribution from the historical data. It may be 
ambiguous sometimes to determine the endpoint from the overlapping of the two 
confidence intervals. The threshold also varies with different chip layout and design. It 
may be time-consuming to develop a new endpoint detection recipe for every change or 
new chip design. 

20 Compared with the moving average, the variance (or standard deviation) of surface 

reflectance provides a more robust means to detect the endpoint. The variance shows a 
clear change at the onset of endpoint in Figure 30. The endpoint can be determined based 
on both the slope and the threshold level of the variance curve. Because of the high 
reflectance difference between Cu and oxide, the change of variance with time is usually 

25 much drastic right before the endpoint for any chip design. The variance of surface also 
remains at a low level after the endpoint because the oxide with high selectivity will retain 
the surface uniformity. Similarly, the variance can be estimated from the measurements 
based on a desired confidence interval. Without knowing the true variance of the surface 
reflectance a 2 , the variance interval with 100(l-a) confidence level is given based on the 
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Chi-square (x 2 ) distribution. 

2 2 (21) 

Xal2,N-\ %\-a/2,N-\ 

It is shown that the estimated variance does not vary significantly within a short period 
of overpolishing. The threshold of variance will also approximately remain a constant 
5 between runs for a given pattern design. Therefore, the endpoint is much easily 
determined based on the variance information than from the mean (moving average). In 
practice, the ratio of standard deviation to the mean reflectance can be employed to 
incorporate the characteristics of mean and variance of reflectance for endpoint detection, 
as shown in Figure 35. The endpoint is indicated as a local minimum and can be 

10 determined without the complexity of calculating the slope and the confidence intervals. 

In addition to the wafer-level endpoint, the onset of endpoint on the dies can also 
be determined based on mapping of sampling loci onto the wafer surface. The surface 
conditions on different zones, such as "rings" at different radii, can be determined based 
on the same techniques employed in the wafer-level endpoint detection. The sampling 

15 loci can be designed as described in the earlier section to select the sensing area and 
resolution. Moreover, the mean, variance, and distribution of the surface reflectance also 
provides information for different stages in the polishing process. The variance and the 
variance to mean ratio reach a minimum, and the distribution becomes normal when the 
Cu pattern is planarized. The range of the reflectance increases drastically when the 

20 underlying oxide starts to expose, as shown in Figure 36. The variance to mean ratio 
reaches a maximum when the majority of the excess Cu on the surface is cleared. This 
information can be integrated as part of the in-situ sensing technique to determine 
progress of the CMP process. For multi-step polishing processes, this information can 
also be used to determine the endpoints of each step and increase the capability of process 

25 control. An experiment was conducted to validate the effectiveness of various endpoint 
detection scheme with the same process condition listed in Table 2. Polishing was 
stopped as soon as the standard deviation, the standard deviation to mean ratio, and the 
range indicate the onset of (wafer-level) endpoint, as shown in Figure 37. Pictures of the 
wafers were evaluated and agree with the results achieved by the sensing system, and it 
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was observed that Cu is cleared up on the surface. Although it is hard to identify from 
observation, an ultra-thin Ta barrier, which is more transparent to the light than the thick 

i. 

* layer, may still remain on the surface and may not be detected by the optical sensor. In 
practice, a short period of oveipolishing may be applied after the sensor detects the 
endpoint to ensure that all the metals are completely removed. 
Nomenclature - the following nomenclature is used in the preceding sections: 





A f 




area fraction of metal pattern 




H 




hardness of coating material (N7m z ) 




TT* 

H 




apparent hardness of a composite surface (N/m 2 ) 


10 


h 




thickness of the material removed on wafer surface (m) 




K 




lmtial coatmg thickness (m) 




t 

K 




Preston constant (m z /N) 




k 





wear coefficient 




Pav 




nominal nressure on wafer fN/m^ 




D 
F 




average prca»ure on apaiiem ^iN/ni j 




r 




ranuom error in inicKness measurement yn\ ) 




t 




experiment duration (s) 




t* 




oveipolishing duration (s) 




Vr 




relative linear velocity of wafer (m/s) 


20 


W 




pattern linewidth (m) 








Cartesian coordinates ( m) 




x,y,z 






Ah 




oxide oveipolishing (m) 




8 




Cu dishing (m) 




I 




pattern pitch (m) 


25 


M 




average oveipolishing on a die 






dimensionless geometrical function 




V 




Poissoris ratio 



As taught by the foregoing description and examples, an improved method 
apparatus for chemical mechanical polishing of semiconductor wafers has been provided 
30 by the present invention. The foregoing description of specific embodiments and 
examples of the invention have been presented for the purpose of illustration and 
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description, and although the invention has been illustrated by certain of the preceding 
examples, it is not to be construed as being limited thereby. They are not intended to be 
* exhaustive or to limit the invention to the precise forms disclosed, and obviously many 
modifications, embodiments, and variations are possible in light of the above teaching. 
It is intended that the scope of the invention encompass the generic area as herein 
disclosed, and by the claims appended hereto and their equivalents. 
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WHAT IS CLAIMED IS: 

1 . A chemical mechanical polishing (CMP) apparatus comprising: 
a rotating polishing platen having a first diameter, 

a wafer carrier for holding a wafer in cooperative relationship with said rotating 
platen, said wafer carrier having multiple chambers that allow for independently varying 
pressure within the chambers that urge against the wafer at corresponding multiple 
localized zones on the wafer, 

at least one window formed in said polishing platen whereby said window is 
periodically scanned across a wafer, and 

an optical detection system carried on said platen for transmitting light through 
said window and receiving light reflected from the wafer through said window as it 
rotates past the wafer, to detect the reflectance of materials on the surface of the wafer 
at the multiple localized zones. 

2. The CMP apparatus of claim 1 wherein the reflectance is used to stop the 
polishing independently within each of the multiple localized zones. 

3 . The CMP apparatus of claim 1 wherein the reflectance indicates the state 
20 of polishing of the wafer within each of the multiple localized zones. 

4. The CMP apparatus of claim 1 further comprising: 

a controller, which receives reflectance signals representing the reflectance of 
materials on^the surface of the wafer at the multiple localized zones from the optical 
25 detection system, and said controller is configured to process said reflectance signals to 
determine the state of polishing within each of the localized regions, and to selectively 
vary the pressure independently within each of the multiple chambers responsive to said 
state of polishing determination. 

♦ 

30 5. The CMP apparatus of claim 1 wherein said multiple chambers are formed 

in a flexible membrane and comprise a center chamber surrounded by one or more 
concentric chambers. 
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6. The CMP apparatus of claim 1 wherein the multiple chambers comprise 
a center circular chamber and three annular, concentric chambers. 

7. The CMP apparatus of claim 1 wherein said optical detection system 
5 further includes at least one fiber optic sensor having a bundle of transmit and receive 

optical fibers terminating at a sensor tip, a light source which transmits light through the 
transmit optical fibers to the surface of the wafer, and a photodector which receives 
reflected light from the surface of the wafer through the receive optical fibers. 

1 0 8. The CMP apparatus of claim 7 wherein said transmit and receive optical 

fibers are oriented substantially normal to the surface of the wafer. 

9. The CMP apparatus of claim 7 wherein the sensor tip is spaced apart from 
the surface of the wafer to form a gap, and the size of the gap is in the range of about 200 

15 to 250 mils. 

1 0. The CMP apparatus of claim 7 wherein the light source is a light emitting 
diode which emits light at a wavelength of about 880 nm. 

20 11. The CMP apparatus of claim 1 wherein the materials on the surface of the 

wafer are any one of, or a combination of, conductive, insulating or barrier materials. 

12. The CMP apparatus of claim 1 1 wherein said materials may be patterned 
on the surface of the wafer. 

25 

13. The CMP apparatus of claim 1 wherein the window scans through the 
center of the wafer. 

14. A method of chemical mechanical polishing (CMP) of a semiconductor 
30 wafer, comprising the steps of: 

providing a CMP machine which includes a polishing pad and a wafer carrier 
having multiple chambers that allow for independently varying pressure within the 
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chambers that urge against a wafer at corresponding localized zones on the wafer; 

measuring the reflectance of the surface of the wafer during polishing at each of 
% the localized zones on the wafer; 

processing the reflectance data to determine the state of polishing within each of 
5 the localized zones; and 

independently adjusting the pressure within any one of the chambers responsive 
to the state of polishing within each of the corresponding localized zones. 

15. The method of claim 14 wherein the step of independently adjusting 
1 0 further comprises: 

reducing or stopping the chemical mechanical polishing, independently within 
each zone when a change in the reflectance is measured in that zone. 



16. The method of claim 15 wherein the chemical mechanical polishing is 
1 5 reduced or stopped in a zone when the change in reflectance is in the range of about 25 

to 60 %. 

17. The method of claim 15 wherein the chemical mechanical polishing is 
reduced or stopped in a zone when the change in reflectance exceeds a predetermined 

20 threshold value. 

18. The method of claim 14 wherein the step of independently adjusting 
further comprises: 

reducing or stopping the chemical mechanical polishing, independently within 
25 each zone according to prior reflectance measurements. 

1 9. The method of claim 14 further comprising: 
detecting the amount of scattering in the reflectance data; 

determining the degree of topographical variations on the surface of the wafer 
30 based on the amount of scattering at the localized zones; and 

controlling the polishing process at the localized zones on the wafer responsive 
to said topographical variations. 
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Scematics of light scattering on 
a patterned Cu surface. 

FIG. 14 
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Schematics of light scattering from 
a planar composite surface. 

FIG. 15A 
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Schematics of light scattering from 
a wavy composite surface. 

FIG. 15B 
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Sensor kinematics. 

FIG. 16 
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Results of in-situ measurements of the moving average and 
standard deviation of wafer- level surface reflectance. 
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